This corpus will represent playlists with generally similar songs, but that are either classified as ‘singing in the shower’ playlists or pop playlists. By comparing these groups, I will try to see if there is a measurable difference in certain attributes measured by Spotify, to define what it means to be a ‘singing in the shower’ playlist. I chose the most-followed ‘singing in the shower’ playlists on Spotify, so I believe it should be a good representation of music that people do enjoy while showering.
The ‘singing in the shower’ playlists used:
The pop playlists used:
For more information on the playlists used, take a look at the Appendix tab.
Based on the Random Forest classification, we can see that the most important differentiating feature is valence, with MeanDecreaseGini ~ 23. The next most important features are speechiness, with MeanDecreaseGini ~ 17, and c05, c09, and B (all with MeanDecreaseGini ~16).
I use these top features in the kNN classification, which we can see the results of in the next tab. I also use these top features throughout the storyboard, when choosing track-level features to analyze and songs to compare that are representative of each set of playlists.
This kNN classifier, even with modified and optimized feature values from the random forest classification, only achieves an accuracy of 0.55. This is only a slightly better performance than if one were to guess the playlist with a coin flip (and achieve ~0.5 accuracy). This is even the case when using the full dataset for both sets of playlists, and when using all feature values for the classification. So, it seems to indicate that it’s quite difficult to classify a song from both sets of playlists as being from either a ‘singing in the shower’ playlist or a pop playlist.
These results show a trend that we will see throughout the storyboard as I explore other features in depth: it’s very difficult to differentiate ‘singing in the shower’ playlists from pop playlists, and as such, it’s increasingly difficult to define ‘singing in the shower’ playlists, without really being able to separate the two.
The hierarchical clusterings are computed based on Euclidean distances, using a minimax linkage function.
In the final clustering graph, blue represents ‘singing in the shower’ songs and red represents pop songs.
What’s remarkable when looking at the clustering of the ‘singing in the shower’ playlists and the pop playlists is how similar they are in terms of groupings. They both seem to have 3-4 main clusters, and the structure of the clusters is quite similar, too.
Further, when looking at the clustering of both sets of playlists combined (with songs randomly chosen from each playlist), we see that there aren’t any very large clusterings with mostly ‘singing in the shower’ or pop music. There are some smaller clusters that are mostly one or the other playlist, but in large part, we see a general mix of both sets of playlists in most of the clusters. Thus, yet another method has failed to really differentiate ‘singing in the shower’ and pop playlists.
Thus far, overarching techniques (classification and clustering) have been unable to provide us with concrete insights on the differences between the two sets of playlists. So, next we will look at lower-level features in the hope that they may shed some light on playlist group differences, on a more microscopic scale.
Generally, when looking at pop music, a valence vs. energy plot is used for preliminary analysis, which is why I have chosen this plot as a reference for what I expect the music distributions to follow. Pop music is generally in the upper right quadrant (high energy and high valence), a bit less in the upper left quadrant (high energy and low valence), infrequently in the lower left quadrant (low energy and low valence), and rarely in the lower right quadrant (low energy and high valence).
Looking at this first scatterplot, we see the greatest concentration of music, for both playlists, in the quadrant with high energy and high valence. This is the distribution I expected from the pop playlists. It also seems valid, given both playlists seem to have a large mix of fast-paced, loud, and cheerful music. As such, we can see our first real similarity in both playlists.
More interestingly, we see that the ‘singing in the shower’ playlists have a wider array of music. Particularly, it seems to have a decent proportion of songs with 0-0.5 valence, as compared to the ‘pop’ playlist. One theory for this is that top charts ‘pop’ music is generally “catchy” because it has a positive and fast beat, but songs that people generally like to sing along to can be happy or more angry, that usually have fast and loud beats. Neither, however, seem to have much music with low energy and low valence, but music in that category tends to be perceived as sadder, which makes it less likely to be considered a ‘pop’ song, and also most likely makes it not as fun to sing in the shower (though this is mere speculation on my behalf).
Since valence and speechiness were classified earlier as the most important differentiating track-level features between both sets of playlists, we will take a closer look at them here.
We can see that most of the music from both playlists is concentrated in the same area, with speechiness values ~0.05 and valence values ~0.5. While both playlists have music on the full spectrum of valence, ‘singing in the shower’ playlists have more songs with speechiness close to 0.6, with higher upper quartile and maximum speechiness values than pop playlists.
For the points above the speechiness = 0.2 line, it’s clear to see that the ‘singing in the shower’ music is more on the low valence side, and the pop music is more on the high valence side (pop music has a higher valence median than shower music).
One possible explanation for these findings is that ‘pop’ songs that are generally the most enjoyed are ones that are more happy, but music that is enjoyed while showering can be sad or depressing too, as long as people can still pretend to be the Beyoncé in their bathroom by “belting out” the tunes. An important category that falls into the latter description is sad, upbeat music, as it makes for good songs to sing along to. Also, since pop music appears on the radio, in clubs, and many other places, it makes sense that the music is mostly sung (with few spoken words), whereas that’s less of a requirement for ‘singing in the shower’ music.
What’s most striking when looking at the densities of four different track-level features for both sets of playlists is how similar the distributions for both look. These plots begin to indicate how people may not find songs from either playlist to be discernible as a shower vs. pop song.
Defining ‘singing in the shower’ playlists through track-level features clearly won’t be sufficient, either. I will need other measures to better define these playlists separate from pop playlists, so I will examine them on an even smaller scale in upcoming tabs, comparing a representative song from each set of playlists.
Since valence and speechiness have the highest rank in terms of feature importance, according to the random forest classification, I chose a song from each set of playlists that I feel is most representative of its set as follows:
The chromagrams were calculated using Euclidean distances. This measure was chosen because it created the clearest chromagrams of all the available distance measures. It will also be used for the next tab for the same reason, as well as for consistency.
In a preliminary analysis of the plots, we see that ‘Colors’ has more dense chroma features than ‘Blank Space’. Both use a large range of pitch classes, and the chromagrams seem quite similar on first glance. That said, when looking at a fine level, some key differences emerge. ‘Colors’ uses pitch classes G#/Ab, F#/Gb, D#/Eb, unlike ‘Blank Space’. On the other hand, ‘Blank Space’ has noticeable use of pitch classes A, G, E (briefly), and D, unlike ‘Colors’. ‘Blank Space’ also has greater magnitude points than ‘Colors’, although both don’t have very clear/clean chromagrams.
The self-similarity matrices were calculated using Euclidean distances.
The differences noted in the previous tab between the most representative song from each playlist are even more noticeable when comparing the chroma and timbre self-similarities matrices of both songs. The most noticeable difference is that ‘Blank Space’ has more distinct points of novelty / changes throughout the song, whereas ‘Colors’ has few unexpected parts of the song (where the biggest change is exactly at the start of the song). ‘Blank Space’ also has a very consistent structure, marked by the very clear grid in both of its self-similarity matrices. ‘Colors’, on the other hand, has a far fuzzier song structure. Additionally, ‘Blank Space’ shows very distinct texture changes that, again, are more unnoticeable for ‘Colors’. That said, both songs show a lot of repetition, as marked by the diagonal lines.
Since we saw some interesting differences in the chroma and timbre self-similarity matrices, we now look to the timbre coefficient values of these sets of playlists to see if they show additional differences we haven’t yet observed.
Timbre coefficients 3, 5, and 7 seem like the best markers for differences in both sets of playlists. For more context, the third looks at the flatness of the songs, the fifth looks at the brightness of the song at different points of time, and the seventh looks at a combination of brightness and songs with a beginning attack.
That said, the differences (especially when looking at coefficients 3 and 5) don’t seem to be that significant. Also, the rest of the coefficients seem to indicate close similarities in timbre values across both sets of playlists, suggesting how similar the ‘singing in the shower’ playlists are to the pop ones.
As the final measure of analysis addressing the question of what a ‘singing in the shower’ playlist is, as compared to a pop one, we unfortunately still have not had fruitful enough results to definitively say that these playlists are different, let alone what these differences would be.
I looked for differences on a categorical scale (classification/clustering), a detailed local scale (track-level features), and a miniscule (song vs. song) scale. There weren’t many differences found between the two sets of playlists, on any of these scales, and all of the differences found were really minor, which leads me to believe that there are no actual differences to find. As such, the best conclusion I can draw from my analyses throughout the storyboard is that ‘singing in the shower’ and pop playlists are, indeed, the same.
While these results are a bit disappointing in that they’re not quite flashy or revolutionary, they are still interesting. They call to question how Spotify classifies its playlists, and begs the question: what other Spotify playlist “genres” are actually unique / novel?
The direct results of this project may not have many benefits, but I do think that the consideration of what makes a specific Spotify playlist distinct/unique could benefit research projects focusing on Spotify (or even other) playlists.
Finally, on a personal note, having computed and experimented with different analytical functions and strategies throughout this project, I now have the foundational tools to ask other meaningful computational musicology questions and seek initial answers.
Singing in the Shower Playlists:
| Playlist Name | ID | Songs |
|---|---|---|
| Songs to Sing in the Shower | 37i9dQZF1DWSqmBTGDYngZ | 70 |
| Shower / sing-a-long | 1rmsEzwr6ZmRNzCUph24vZ | 81 |
| Shower | 1dTgSkYRwILdvmckibB9AP | 440 |
Pop Playlists:
| Playlist Name | ID | Songs |
|---|---|---|
| Pop Hits 2000-2018 | 6mtYuOxzl58vSGnEDtZ9uB | 293 |
| Pop Hits Rewind | 0RPcfl1sCsJ03B0bztuKAn | 70 |
| Guilty Pleasures | 37i9dQZF1DX4pUKG1kS0Ac | 151 |
| Mega Hit Mix | 37i9dQZF1DXbYM3nMM0oPk | 75 |